Identifying Interesting Missing Patterns

نویسندگان

  • Bing Liu
  • Wynne Hsu
  • Hing-Yan Lee
چکیده

One of the important issues in data mining is the subjective “interestingness” problem. It has been shown that in many situations a huge number of patterns can be discovered from a database. Most of these patterns are actually useless or uninteresting to the user. But because of the huge number of patterns, it is difficult for the user to identify those patterns that are of interest to him/her. Past research proposed two main measures of subjective interestingness: unexpectedness and actionability. Both these measures focus on helping the user identify interesting discovered patterns. In this paper, we show that missing patterns (absence of some patterns) are interesting too. An approach has been proposed to identify the interesting missing patterns. The proposed approach is an extension of our previous work. In our previous work, we studied the subjective interestingness problem based on the concept of user’s expectations and fuzzy set theory. In that study, the discovered patterns are ranked in different ways according to their unexpectedness to the user. In this paper, we examine the extension to our previous work so as to identify the interesting missing patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Case Report: Congenitally Missing Teeth

Congenitally missing of maxillary lateral incisors is one of the most common patterns of hypodontia. This paper presents a nine year old boy with congenital missing of lateral incisors. Familial history showed that, his mother, aunts, uncle and grandmother have also congenital absence of lateral incisors.

متن کامل

Identifying Information-Rich Subspace Trends in High-Dimensional Data

Identifying information-rich subsets in high-dimensional spaces and representing them as order revealing patterns (or trends) is an important and challenging research problem in many science and engineering applications. The information quotient of large-scale high-dimensional datasets is significantly reduced by the curse of dimensionality which makes the traditional clustering and association...

متن کامل

میزان شیوع Missing دندان مولر سوم در cl I و cl II اسکلتال در بیماران مراجعه کننده به دانشکده دندانپزشکی دانشگاه علوم پزشکی تهران در فاصله سال‌های 1380 تا 1385

Background and Aims: One of the common human evolutionary anomalies is dental Missing. Evolution of dental system is toward deduction of teeth number. The Missing of third molar is interesting subject for dentists and genetic researches because of its variety in different races. Consideration of bilateral effect of third molar Missing with jaw relation is an important subject. The aim of this s...

متن کامل

Understanding Patterns with Different Subspace Classification

By identifying characteristic regions in which classes are dense and also relevant for discrimination a new, intuitive classification method is set up. This method enables a visualized result so the user is provided with an insight into the data with respect to discrimination for an easy interpretation. Additionally, it outperforms Decision trees in a lot of situations and is robust against out...

متن کامل

Identifying the Zygosity Status of Twins Using Bayes Network and Estimation- Maximization Methodology

As the renaissance of family-based genomic research, identifying diseases and families for this kind of study becomes increasingly important. Recently, a large cohort of predicted twins, called the Marshfield Clinic Twin Cohort, has been generated by analyzing standard demographic information in an electronic medical record. However, one piece of critical information, the zygosity status (ident...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007